AITopics | localization map

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation

Neural Information Processing SystemsNov-15-2025, 23:19:47 GMT

Our experimental evaluations demonstrate that this simple modification significantly improves the quality of localization maps on both the P ASCAL VOC 2012 and MS COCO 2014 datasets, exhibiting a new state-of-the-art performance for weakly supervised semantic segmentation.

artificial intelligence, machine learning, segmentation, (16 more...)

Neural Information Processing Systems

Country: Asia > South Korea > Seoul > Seoul (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

66738d21d3cddb8717ca52deff5a5546-Supplemental-Conference.pdf

Neural Information Processing SystemsNov-14-2025, 16:25:31 GMT

artificial intelligence, localization map, machine learning, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.73)

Add feedback

66738d21d3cddb8717ca52deff5a5546-Paper-Conference.pdf

Neural Information Processing SystemsNov-14-2025, 16:25:27 GMT

artificial intelligence, machine learning, segmentation, (10 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Vision (0.98)
Information Technology > Sensing and Signal Processing > Image Processing (0.94)

Add feedback

Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching

Neural Information Processing SystemsNov-14-2025, 06:12:32 GMT

In this paper, we propose a two-stage learning framework to perform self-supervised class-aware sounding object localization. First, we propose to learn robust object representations by aggregating the candidate sound localization results in the single source scenes. Then, class-aware object localization maps are generated in the cocktail-party scenarios by referring the pre-learned object knowledge, and the sounding objects are accordingly selected by matching audio and visual object category distributions, where the audiovisual consistency is viewed as the self-supervised signal. Experimental results in both realistic and synthesized cocktail-party videos demonstrate that our model is superior in filtering out silent objects and pointing out the location of sounding objects of different classes.

artificial intelligence, localization, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
North America > Canada (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

RGC: a radio AGN classifier based on deep learning. I. A semi-supervised model for the VLA images of bent radio AGNs

Hossain, M. S., Shahal, M. S. H., Khan, A., Asad, K. M. B., Saikia, P., Akter, F., Ali, A., Amin, M. A., Momen, A., Hasan, M., Rahman, A. K. M. M.

arXiv.org Artificial IntelligenceOct-28-2025

Wide-angle tail (WAT) and narrow-angle tail (NAT) radio active galactic nuclei (RAGNs) are key tracers of dense environments in galaxy groups and clusters, yet no machine-learning classifier of bent RAGNs has been trained using both unlabeled data and purely visually inspected labels. We release the RGC Python package, which includes two newly preprocessed labeled datasets of 639 WATs and NATs derived from a publicly available catalog of visually inspected sources, along with a semi-supervised RGC model that leverages 20,000 unlabeled RAGNs. The two labeled datasets in RGC were preprocessed using PyBDSF which retains spurious sources, and Photutils which removes them. The RGC model integrates the self-supervised framework BYOL (Bootstrap YOur Latent) with the supervised E2CNN (E2-equivariant Convolutional Neural Network) to form a semi-supervised binary classifier. The RGC model, when trained and evaluated on a dataset devoid of spurious sources, reaches peak performance, attaining an accuracy of 88.88% along with F1-scores of 0.90 for WATs and 0.85 for NATs. The model's attention patterns amid class imbalance suggest that this work can serve as a stepping stone toward developing physics-informed foundation models capable of identifying a broad range of AGN physical properties.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2510.2219

Country:

Oceania > Australia (0.04)
Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.04)
Africa > South Africa (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching

Neural Information Processing SystemsOct-3-2025, 05:51:13 GMT

In this paper, we propose a two-stage learning framework to perform self-supervised class-aware sounding object localization. First, we propose to learn robust object representations by aggregating the candidate sound localization results in the single source scenes. Then, class-aware object localization maps are generated in the cocktail-party scenarios by referring the pre-learned object knowledge, and the sounding objects are accordingly selected by matching audio and visual object category distributions, where the audiovisual consistency is viewed as the self-supervised signal. Experimental results in both realistic and synthesized cocktail-party videos demonstrate that our model is superior in filtering out silent objects and pointing out the location of sounding objects of different classes.

artificial intelligence, localization, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
North America > Canada (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

e6384711491713d29bc63fc5eeb5ba4f-Paper.pdf

Neural Information Processing SystemsAug-22-2025, 01:24:07 GMT

artificial intelligence, machine learning, segmentation, (15 more...)

Neural Information Processing Systems

Country: Asia > South Korea > Seoul > Seoul (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Expansion and Shrinkage of Localization for Weakly-Supervised Semantic Segmentation Anonymous Author(s) Affiliation Address email A Appendix 1 A.1 Additional Analysis 2

Neural Information Processing SystemsAug-15-2025, 11:13:53 GMT

We conduct an ablative study that we allow these weights to be updated as well.

artificial intelligence, localization map, machine learning, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.73)

Add feedback

66738d21d3cddb8717ca52deff5a5546-Paper-Conference.pdf

Neural Information Processing SystemsAug-15-2025, 11:13:49 GMT

artificial intelligence, machine learning, segmentation, (11 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Vision (0.98)
Information Technology > Sensing and Signal Processing > Image Processing (0.94)

Add feedback

Developing an AI-Guided Assistant Device for the Deaf and Hearing Impaired

Jiayu, null, Liu, null

arXiv.org Artificial IntelligenceJul-22-2025

This study aims to develop a deep learning system for an accessibility device for the deaf or hearing impaired. The device will accurately localize and identify sound sources in real time. This study will fill an important gap in current research by leveraging machine learning techniques to target the underprivileged community. The system includes three main components. 1. JerryNet: A custom designed CNN architecture that determines the direction of arrival (DoA) for nine possible directions. 2. Audio Classification: This model is based on fine-tuning the Contrastive Language-Audio Pretraining (CLAP) model to identify the exact sound classes only based on audio. 3. Multimodal integration model: This is an accurate sound localization model that combines audio, visual, and text data to locate the exact sound sources in the images. The part consists of two modules, one object detection using Yolov9 to generate all the bounding boxes of the objects, and an audio visual localization model to identify the optimal bounding box using complete Intersection over Union (CIoU). The hardware consists of a four-microphone rectangular formation and a camera mounted on glasses with a wristband for displaying necessary information like direction. On a custom collected data set, JerryNet achieved a precision of 91. 1% for the sound direction, outperforming all the baseline models. The CLAP model achieved 98.5% and 95% accuracy on custom and AudioSet datasets, respectively. The audio-visual localization model within component 3 yielded a cIoU of 0.892 and an AUC of 0.658, surpassing other similar models. There are many future potentials to this study, paving the way to creating a new generation of accessibility devices.

artificial intelligence, localization model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2507.14215

Genre: Research Report > New Finding (0.47)

Industry: Health & Medicine (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Collaborating Authors

localization map

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation

66738d21d3cddb8717ca52deff5a5546-Supplemental-Conference.pdf

66738d21d3cddb8717ca52deff5a5546-Paper-Conference.pdf

Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching

RGC: a radio AGN classifier based on deep learning. I. A semi-supervised model for the VLA images of bent radio AGNs

Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching

e6384711491713d29bc63fc5eeb5ba4f-Paper.pdf

Expansion and Shrinkage of Localization for Weakly-Supervised Semantic Segmentation Anonymous Author(s) Affiliation Address email A Appendix 1 A.1 Additional Analysis 2

66738d21d3cddb8717ca52deff5a5546-Paper-Conference.pdf

Developing an AI-Guided Assistant Device for the Deaf and Hearing Impaired